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Description 

Isolated Ordered Regions (IOR) Node 
Order 

Background of Invention 
Field of Invention 

[0001] The present invention relates generally to the field of or- 
dering nodes. More specifically, the present invention is 
related to ordering nodes within hierarchical data. 
Discussion of Prior Art 

[0002] a tree structure comprising nodes is a type of data struc- 
ture in which each element is attached to one or more el- 
ements directly beneath it. The connections among ele- 
ments in a tree structure are called branches. Trees are 
often called inverted trees because they are normally 
drawn with the root at the top. Inverted trees are the data 
structures used to represent hierarchical file structures. In 
this case, the leaves are files and the other elements 
above the leaves are directories. 



[0003] T ree structures have been used in prior art data process- 
ing systems to organize data. But, such prior art fails to 
provide for a system for ordering nodes within hierarchi- 
cal data in a mark-up language-based document. Addi- 
tionally, such prior art systems fail to teach a method or 
system for ordering nodes into a plurality of regions, 
wherein each of the regions defines an area within a two- 
dimensional space. 

[0004] The following references provide for a general teaching 

with regard to hierarchical structures, but they fail to pro- 
vide for the claimed invention's method and system. 

[0005] u.S. patent 5,970,489 discloses a method for using re- 
gion-sets to focus searches in hierarchical structures. The 
method improves a search in a hierarchical structure by 
focusing the search to selected regions within the struc- 
ture. The method defines one or more region-set(s) and 
uses the region-set(s) as either a filter for the results of a 
key-word search or an integrated part of a search engine 
to increase the efficiency of the search engine. The 
method also provides for dynamic creation of new region- 
set(s) from existing region-set(s) using a prescribed set of 
operators. 

[° 006 ] U.S. patent application publication 2002/0118214 Al 



provides a system and method for browsing node-link 
structures based on an estimated degree of interest. The 
disclosed invention enables a user to view large collec- 
tions of linked information on a computer-based display. 
A visualization is created which presents a representation 
of the complete collection of information on the display. 
The visualization fits completely within a fixed area of the 
computer-based display, negating the need to scroll in- 
formation into the display area. The visualization is based 
on identified focus nodes and through calculation of a 
Degree of Interest (DOI) for each of the nodes based in the 
structure. Layout and presentation of the visualization 
structure are based on the DOI values in combination with 
considerations of available display space. A user may dy- 
namically manipulate views of the structure by selecting 
one or more focus nodes, thus causing a recalculation of 
the degree of interest. 
[0007] us patent 6,359,635 provides for methods, articles, and 
an apparatus for visibly representing information and for 
providing an input interface. Display and/or input regions 
to represent a plurality of tree-related elements include: a 
region to represent a root element; at least two concave 
regions to represent at least two non-root, internal ele- 



merits; and at least two regions to represent at least two 
leaf elements. In a particular embodiment, a first concave 
region is radially adjacent to both a second concave re- 
gion and a third concave region, the second concave re- 
gion angularly adjacent to the third concave region, and a 
radial width of the second concave region differing from a 
radial width of the third concave region. 
[0008] whatever the precise merits, features, and advantages of 
the above cited references, none of them achieves or ful- 
fills the purposes of the present invention. 
Summary of Invention 

[0009] The present invention provides for a system and method 
for ordering a plurality of nodes associated with entities in 
a document, wherein the system comprises: (a) a node 
generator parsing the entities in a document and creating 
a plurality of nodes that represent the entities and rela- 
tionships that exists among the entities; (b) a node 
grouper grouping said created plurality of nodes into a 
plurality of regions, each of said regions defining an area 
within a two-dimensional space; and (c) a formatter for 
formatting said plurality of regions for storage in one or 
more pages. 

[0010] Modifications to nodes within a region only affect the 



nodes in that region - not nodes in other regions. Traver- 
sal that retrieve information from the nodes can rebase 
the coordinates from their containing region and return 
with a total order. 
[0011] The present invention also provides for a method for or- 
dering nodes in a document via isolated ordered regions, 
said method comprising the steps of: (a) parsing the doc- 
ument; (b) creating nodes representing entities of the 
document and a relationship that exist among the enti- 
ties; (c) mapping the created nodes based upon a level 
and step associated with each of the nodes; (d) grouping 
the mapped nodes into a plurality of regions, wherein the 
grouping identifies, for each of said regions, at least the 
following parameters: a minimum step, a minimum level, 
a maximum step, a maximum level, and the parameters 
give a region its dimension and order within said docu- 
ment; (e) ordering the regions based upon ascending 
minimum step and minimum level; (f) calculating step 
range associated with each of the regions; and (g) re- 
ordering the regions based upon ascending step range, 
minimum level, and minimum step, wherein the reorder- 
ing reflect parent-child relationships among the nodes. 
Brief Description of Drawings 



[0012] Figure 1 illustrates an overview of the system of the 

present invention. 
[0013] Figure 2 illustrates how, in a specific example, nodes from 

an XML document are extracted. 
[0014] Figures 3a-c illustrate differing sets of regions formed 

from a representative XML document. 
[0015] Figures 4a and 4c illustrate the next stage, wherein the 

regions as defined in Figures 3a and 3b are ordered and 

stored in pages 

[0016] Figures 5a and 5b illustrate a specific example depicting 
the mapping of nodes in a hierarchically structured docu- 
ment based upon steps and levels. 

[0017] Figure 6 illustrates mapped nodes that are grouped in a 
plurality of regions, i.e., Rl, R2, R3, R4, R5, R6, and R7. 

[0018] Figure 7 illustrates the relationships among the calculated 
step ranges and the various regions of Figure 6. 

[0019] Figure 8 illustrates an example that represents a variation 
of the example illustrated in Figure 6, wherein the varia- 
tion represents added nodes (and, therefore, added re- 
gions). 

[0020] Figure 9 illustrates an example that represents another 
variation of the example illustrated in Figure 6, wherein 
the variation represents different ordering of regions us- 



ing the same set of nodes. 
[0021] Figure 10 illustrates changes in parameters of regions due 

to changes in node structure. 
[0022] Figure 11 illustrates the effect of the addition of nodes. 

[0023] Figure 12 illustrates the effect of nesting levels on various 
mapped regions. 

[0024] Figure 13 illustrates reordering of regions of the docu- 
ment. 

[0025] Figures 14a-b illustrate original and region-relative coor- 
dinates associated with nodes in various regions. 

[0026] Figure 15 illustrates a scenario wherein post order traver- 
sal (POT#) numbers can be computed for each of the 
nodes during traversal to identify containment relation- 
ships among nodes. 

[0027] Figures 16a-b illustrate a set of nodes and their respec- 
tive mapping based on the method of assigning a step 
number every time a child node is descended. 

[0028] Figure 16c illustrates regions Rl through R6 formed from 
the set of nodes. 

[0029] Figures 16d-f illustrates the effect of modifications on the 
above-mentioned parameters with respect to regions Rl 
through R6. 

[0030] Figure 17 illustrates nodes grouped based upon node de- 



scendant regions. 
DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0031] while this invention is illustrated and described in a pre- 
ferred embodiment, the invention may be produced in 
many different configurations. There is depicted in the 
drawings, and will herein be described in detail, a pre- 
ferred embodiment of the invention, with the understand- 
ing that the present disclosure is to be considered an ex- 
emplification of the principles of the invention and the as- 
sociated functional specifications for its construction and 
is not intended to limit the invention to the embodiment 
illustrated. Those skilled in the art will envision many 
other possible variations within the scope of the present 
invention. 

[0032] The present invention presents a system and method for 
ordering nodes within hierarchical data using the concept 
of isolated ordered regions to maintain the coordinates of 
nodes by associating each node with coordinates relative 
to a containing region. Based upon the claimed invention, 
modifications to nodes within a region only affect the 
nodes in that region — not nodes in other regions. 
Traversals that retrieve information from the nodes can 
rebase the coordinates from their containing region and 



return results with a total order. 

[0033] Figure 1 illustrates an overview of the system 100 of the 
present invention. Document 102, containing hierarchical 
data (e.g., a document in a mark-up language-based for- 
mat such as XML), is stored by creating nodes, via node 
generator 104, that parses and represents the entities of 
the document and the relationships that exist among the 
entities. The nodes are grouped into regions (i.e., Region 
R i through R^) via region grouper 106 and are formatted 
via formatter 108 and written out to pages. As the nodes 
are created, they are grouped into regions in various ways 
depending upon anticipated access patterns and usage. 
Each region is formatted and stored into pages managed 
by the system. There can be more than one region written 
to a page. Figure 2 illustrates how, in a specific example, 
nodes from an XML document 200 are extracted. 

[0034] | t should be noted that although throughout the figures 
and specification an XML document has been used to il- 
lustrate the functionality associated with the system and 
method of the present invention, other hierarchically-or- 
dered documents, such as documents in mark-up lan- 
guage formats such HTML, can be equally used in con- 
junction with the present invention. Therefore, the scope 



of the present invention should not be limited by the type 
of hierarchically-ordered document. 
[0035] Figures 3a-c illustrate differing sets of regions formed 

from a representative XML document 200. In this example, 
a set of nodes representative of an XML document can be 
broken, in Figure 3a, into three regions 302, 304, and 306. 
Similarly, in Figures 3b and 3c, the same set of nodes is 
broken into regions 308-314 and 3i6-324,respectively. Fig- 
ures 4a and 4b illustrate the next stage, wherein the re- 
gions as defined in Figures 3a and 3b are ordered and 
stored in pages. 

[0036] Based upon the present invention, each region has a num- 
ber of values associated with it, wherein these values are 
computed using algorithms, to be outlined later. Each re- 
gion has a Minimum Step (Min Step), Minimum Level (Min 
Level), Maximum Step (Max Step), Maximum Level (Max 
Level), and Step Range. The Min Step, Min Level, Max Step, 
and Max Level numbers give a region its dimensions and 
order within the document. 

[0037] Figures 5a and 5b illustrate a specific example depicting 
the mapping of nodes in a hierarchically-structured docu- 
ment based upon steps and levels. The following algo- 
rithm, based on a set of rules, is used in such a mapping: 



[0038] 

For every descendant 

Level = Level + 1 

For every ancestor 

Level = Level - 1 

For every [>lst] child node 

Step=Step+l 

[0039] Next, as in Figure 6, the mapped nodes are grouped in a 
plurality of regions: Rl 600, R2 602, R3 604, R4 606, R5 
608, R6 610, and R7 612. Specific grouping patterns are 
used for illustrative purposes only, and it should noted 
that other groupings of the same set of nodes are envi- 
sioned (as illustrated in Figures 3a-c) in conjunction with 
the present invention. Also, as mentioned earlier, specific 



groupings of regions can depend on anticipated access 
patterns and usage. 
[0040] The regions of Figure 6 are ordered based on ascending 
Min Step and ascending Min Level. Table 1, provided be- 
low, depicts a list of regions ordered based upon ascend- 
ing Min Step and Min Levels. 

[0041] 



Region 


Nesting Level 


Min Step 


Min Level 


Max Step 


Max Level 


Rl 


0 


0 


0 


2 


2 


R2 


0 


1 


3 


3 


4 


R3 


0 


2 


5 


3 


6 


R4 


0 


4 


1 


5 




R5 


0 


6 


2 


9 


2 


R6 


0 


6 


3 


8 


4 


R7 


0 


10 


2 


12 


2 



Table 1 



[0042] Regions are then grouped into Step Ranges. Regions 

within a Step Range have Min Step and Max Step values 
that do not overlap with that of regions in other Step 
Ranges. The algorithm below illustrates how to compute 
the Step Ranges in a set of regions: 

[0043] 

Examine all Regions in order, 

If Min Step > Max Step Seen So Far 



Assign Current Step Range — Min Step 



[0044] Table 2, provided below, depicts Step Ranges (calculated 
based upon the above-mentioned algorithm) of regions of 
Figure 6 and Table 1. 



Region (Parameters) 


Step Ranges 


Rl [0,0,0,2,2] 


0 


R2[0,l,3,3,4] 


0 


R3[0,2,5,3,6] 


0 


R4[0,4,l,5,3] 


4 


R5 [0,6,2,9,2] 


6 


R6[0,6,3,8,4] 


6 


R7[0,10,2 5 12,2] 


10 



Table 2 



[0046] The relationship among the calculated step ranges and 
the various regions of Figure 6 (i.e., Rl 600, R2 602, R3 
604, R4 606, R5 608, R6 610, and R7 612) are shown in Fig- 
ure 7. In Figure 7, SR6 represents regions that have a step 
range equal to 6, which in this case encompasses regions 
R5 (608 of Figure 6) and R6 (610 of Figure 6). 

[0047] Figure 8 illustrates an example that represents a variation 
of the example illustrated in Figure 6, wherein the varia- 
tion represents added nodes (and therefore, added re- 
gions). In addition to the regions of Figure 6, Figure 8 fur- 
ther includes regions R8 802, R9 804, R10 806, and Rll 
808. The addition of these regions (R8-R11) changes the 
Step Ranges associated with all regions depicted in Figure 



8. Table 3, provided below, depicts Step Ranges 
(calculated based upon the above-mentioned algorithm) 
of regions of Figure 8. 



Region (Parameters) 


Step Ranges 


Rl [0,0,0,2,2] 


0 


R2[0,l,3,2,4] 


0 


R8[0,2,5,3,6] 


0 


R3 [0,2,7,3,8] 


0 


R9[0,3,3,4,5] 


0 


Rll [0,5,4,7,5] 


5 


R10[0,8,3,10,5] 


8 


R4[0,ll,l,12,3] 


11 


R5[0,13,2,16,2] 


13 


R6[0,13,3,15,4] 


13 


R7[0,17,2,19,2] 


17 



Table 3 



[0049] Figure 9 illustrates an example that represents another 
variation of the example illustrated in Figure 6, wherein 
the variation represents different ordering of regions us- 
ing the same set of nodes. The new set of regions of Fig- 
ure 9 includes: Rl 900, R2 902, R3 904, R4 906, R5 908, R6 
910, and R7 912. The regions of Figure 9 are ordered 
based upon ascending Min Step and ascending Min Level. 
Table 4, provided below, depicts a list of regions of Figure 
9 ordered based upon ascending Min Step and Min Levels. 



[0050] 



Region 


Nesting Level 


Min Step 


Min Level 


Max Step 


Max Level 


Rl 


0 


0 


0 


1 


2 


R3 


0 


1 


3 


4 


3 


R2 


0 


2 


2 


4 


2 


R4 


0 


2 


4 


3 


6 


R5 


0 


5 


1 


7 


2 


R7 


0 


6 


3 


8 


4 


R6 


0 


8 


2 


12 


2 



[0051] once the Step Ranges are determined, the regions are re- 
ordered. The previous entries ordered based on Min Step, 
Min Level are now ordered based on Step Range, Min 
Level, Min Step. In the above example, regions R2 902, R3 
904 and R6 910, and R7 912 are reordered. This change ac- 
curately reflects the parent-child relationship of the re- 
gions within a Step Range. Table 5, provided below, de- 
picts a list of regions of Figure 9 reordered based upon 
Step Range, Min Levels, Min Step: 



[0052] 



Region 


Nesting Level 


Min Step 


Min Level 


Max Step 


Max Level 


Rl 


0 


0 


0 


1 


2 


R3 


0 


1 


3 


4 


3 


R2 


0 


2 


2 


4 


2 


R4 


0 


2 


4 


3 


6 


R5 


0 


5 


1 


7 


2 


R7 


0 


6 


3 


8 


4 


R6 


0 


8 


2 


12 


2 



Table 5 



[0053] Figure 10 illustrates changes in parameters of regions due 
to changes in node structure. When nodes added to or 
deleted from a region(s) are added to or deleted from a 
document, the Step Range, Min Level, Min Step, Max Level, 



and Max Step values of neighboring regions are affected. 
For regions in the same Step Range, retraversal of the re- 
gions needs to be performed to recompute the Min Level, 
Min Step, Max Level, and Max Step values. For regions in 
Step Ranges that follow, only the Min Step and Max Step 
values need adjustment. 

[0054] jo defer retraversal of regions during modifications, re- 
gions affected can be assigned to another nesting level. 
When this happens, the original set of regions retains its 
dimensions with respect to other regions in the same 
nesting level while a new set of regions with a higher 
nesting level is created. 

[0055] For example, as shown in Figure 11, nodes were added to 
region R2 of Figures 6 and 7 which modified its dimen- 
sions and caused other regions R2 (of nesting level 1) 
1100, R8 1102, R9 1104, and RIO 1106 to be created. The 
index used to maintain the order of the regions still pre- 
serves the order of R2 1108 with respect to R3 1109 in 
nesting level 0, but now R2 1110, R9 1112, R10 1114, and 
R8 1116 of nesting level 1 are ordered in between (shown 
as tabbed entries in Figure 11). 

[0056] The regions of the new nesting level have Min Level, Min 
Step, Max Level, and Max Step values that are computed 



with respect to their parent nesting level region. For ex- 
ample, R8 1116 in nesting level 1 is two Levels and one 
Step away from the axis of R2 1108 in nesting level 0. It 
should be noted that, in Figure 11, the axis of R2 1100 
starts off a new Step count (from 0 to 6) and a new Level 
count (from 0 to 3) in nesting level 1. 
[0057] Further modifications to the document cause more re- 
gions of higher nesting levels to be created, all based of a 
parent level region. For example, as illustrated in Figure 
12, R9 of nesting level 1 1200 grows, causing R9 1202 and 
Rll 1204 of nesting level 2 to be created. Again, it should 
be noted that regions in ancestor nesting levels do not 
change. 

[0058] After the retraversal of regions within the Step Range and 
the adjustment of Min Step and Max Step values of the re- 
gions in the Step Ranges that follow, the regions of the 
document will be reordered, as shown in Figure 13. 

[0059] Traversal s that extract information from nodes require an 
order for each of the nodes in the document. Step and 
Level values can be associated with each of the nodes to 
impose an order for all nodes. These values are computed 
relative to the Step and Level values of the containing re- 
gion. Modifications in the document only affect the region 



coordinates which are maintained in a Region Index — 
not the node coordinates. In the example illustrated in 
Figure 14a, the coordinate of the H node in Region R2 
with Step 2, Level 3 is Step 0, Level 1 relative to the re- 
gion. Therefore, H nodes" absolute coordinate is Step 2, 
Level 4. Figure 14b illustrates original and region-relative 
coordinates associated with nodes in Regions R1-R4. 
[0060] Figure 15 illustrates a scenario wherein post-order traver- 
sal (POT#) numbers can be computed for each of the 
nodes during traversal to identify containment relation- 
ships among nodes. Given the Step, Level, and POT# val- 
ues of two nodes, the system can determine whether one 
node contains the other node. In the example below, the 
node with Step 5, Level 2, POT# 19 contains the node with 
Step 7, Level 4, POT# 13; but the node with Step 10, Level 
1, POT# 24 does not contain the node with Step 7, Level 
4, POT# 13. The algorithm for identifying if a node Nl 
contains another node N2 is given below: 

[0061] 



LETPOT# = Post Order Traversal Number 
LEVEL = Node Level Coordinate 
STEP = Node Step Coordinate 

FOR EVERY Node Nl AND Node N2 

IF (Nl POT# > N2 POT#) AND 
(Nl LEVEL < N2 LEVEL) AND 
(Nl STEP <= N2 STEP) 

THEN 

Nl CONTAINS N2 



[0062] it should be noted that although algorithms identified 

above (e.g., in the discussions of Figure 5a-5b) are based 
on a set of rules associated with how steps are computed, 



other variations of computing steps are also within the 
scope of the invention. For example, the algorithm de- 
scribed in relation to Figure 5a-b has problems with up- 
dates, as it could move nodes from one region out into 
another region during updates. For example, if the docu- 
ment were a, b, c, and d, where a->b, and b->c and b- 
>d, then a, b, and c would be assigned step 0 and d as- 
signed step 1; but during updates, if c were removed, d 
would need to be moved up one step, potentially into a 
different region from where it was originally. A simpler 
way of "counting" steps is to monotonically increase the 
step number every time a child node is descended. That 
is, if the document is a->b->c, b->d, a->e, then the step 
numbering would be a (1), b (2), c (3), d (4), e (5). This 
step numbering scheme is the same as that for preorder 
traversal of a tree of nodes and is illustrated in Figures 
16a-f. 

[0063] Figures 16a-b illustrate a set of nodes and their respec- 
tive mapping based on the above-mentioned method of 
assigning a step number every time a child node is de- 
scended. Figure 16c illustrates regions Rl through R6 
formed from the set of nodes wherein the parameters of 
Rl through R6 are provided below: 
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0 


26 
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[0065] Figures 16d-f illustrate the effect of modifications on the 
above-mentioned parameters with respect to regions Rl 
through R6. Thus, as can be seen in the examples illus- 
trated in Figures 16a-f, the specific algorithm used to 
compute the steps can vary and, hence, should not be 
used to limit the scope of the present invention. 

[0066] Similarly, regions can be grouped in a varying fashion. For 
example, as in Figure 17, nodes can be grouped based 
upon node descendant regions. Node descendant regions 
are regions that contain all nodes that are descendents of 
a particular node. Figure 17 illustrates Node Descendant 
Regions NDR1-NDR4, whose parameters are summarized 
below: 



[0067] 



Node Descendent Region 


Min Step 


Min Level 


Max Step 


Max Level 


NDR1 


2 


2 


2 


2 


NDR2 


5 


2 


12 


6 


NDR3 


17 


2 


18 


3 


NDR4 


19 


2 


21 


4 



[0068] Thus, as can be seen in the example above, various rules 
can be used to identify regions among a set of nodes and, 



hence, such rules should not be used to limit the scope of 
the present invention. 
[0069] Additionally, the present invention provides for an article 
of manufacture comprising computer readable program 
code contained within, implementing one or more mod- 
ules for ordering nodes in a document (e.g., XML docu- 
ment). Furthermore, the present invention includes a 
computer program code-based product, which is a stor- 
age medium having program code stored therein which 
can be used to instruct a computer to perform any of the 
methods associated with the present invention. The com- 
puter storage medium includes any of, but is not limited 
to, the following: CD-ROM, DVD, magnetic tape, optical 
disc, hard drive, floppy disk, ferroelectric memory, flash 
memory, ferromagnetic memory, optical storage, charge 
coupled devices, magnetic or optical cards, smart cards, 
EEPROM, EPROM, RAM, ROM, DRAM, SRAM, SDRAM, or any 
other appropriate static or dynamic memory or data stor- 
age devices. 

[0070] implemented in computer program code-based products 
are software modules for: (a) parsing said document; (b) 
creating nodes representing entities of the document and 
relationships that exist among the entities; (c) mapping 



the created nodes based upon a level and step associated 
with each of the nodes; (d) grouping the mapped nodes 
into a plurality of regions, wherein the grouping identifies, 
for each of the regions, at least the following parameters: 
a minimum step, a minimum level, a maximum step, a 
maximum level, said parameters giving a region its di- 
mension and order within the document; (e) ordering the 
regions based upon ascending minimum step and mini- 
mum level; (f) calculating the step range associated with 
each of the regions; and (g) reordering the regions based 
upon ascending step range, minimum level, and minimum 
step, wherein the reordering reflecting parent-child rela- 
tionships among said nodes. 
Conclusion 

[0071] a system and method have been shown in the above em- 
bodiments for the effective implementation of a method 
and system for ordering nodes via isolated ordered re- 
gions node order. While various preferred embodiments 
have been shown and described, it will be understood that 
there is no intent to limit the invention by such disclosure 
but, rather, it is intended to cover all modifications and 
alternate constructions falling within the spirit and scope 
of the invention as defined in the appended claims. For 



example, the present invention should not be limited by 
type of hierarchically-ordered document, the type of algo- 
rithm used to calculate the step, number of nodes, num- 
ber of levels, number of steps, number and shape of re- 
gions, software/program, or computing environment. 
[0072] The above enhancements are implemented in various 

computing environments. For example, the present inven- 
tion may be implemented on a conventional IBM PC or 
equivalent, multi-nodal system (e.g., LAN) or networking 
system (e.g., Internet, WWW, wireless web). All program- 
ming, GUIs, display panels and dialog box templates, and 
data related thereto are stored in computer memory, 
static or dynamic, and may be retrieved by the user in any 
of: conventional computer storage, display (i.e., CRT) and/ 
or hardcopy (i.e., printed) formats. The programming of 
the present invention may be implemented by one of skill 
in the art of mark-up-based languages and database pro- 
gramming. 



